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METHOD AND IMPLEMENTATION FOR DETECTING AND CHARACTERIZING 
AUDIBLE TRANSIENTS IN NOISE 

5 

Field of the Invention 

The present invention relates to identifying 
impulsive sounds in a vehicle, and more specifically, to a 
10 method and implementation for detecting and characterizing 
audible transients in noise. 

Background of the Invention 

15 Impulsive sounds are defined as short duration, high 

energy sounds usually caused by an impact. Examples of 
impulsive sounds include gear rattle, body squeaks and 
rattles, strut chuckle, ABS, driveline backlash, ticking 
from valve-train and fuel injectors, impact harshness, and 

20 engine rattles. Methods that can determine and predict 
the audible threshold of these impulse sounds, as well as 
identify their above-threshold characteristics, are 
important tools. The ability to predict thresholds is 
useful for cascading vehicle-level thresholds down to 

25 component- level thresholds, and ultimately, in developing 
appropriate bench tests for system components. 
Identifying the above-threshold characteristics is useful 
as a diagnostic tool for identifying impulsive sounds in a 
vehicle, and also for developing relevant sound quality 

30 methods . 

Three properties of a detection and classification 
algorithm are desired. The first is to detect different 
classes of impulsive sounds without having to subjectively 



tune algorithm parameters for each class. The second is 
to identify the temporal and spectral characteristics of 
the impulsive sounds. The final desired property is to 
correlate predicted thresholds with subjective detection 
5 thresholds. Existing algorithms do not satisfy all three 
properties. Current algorithms that identify temporal and 
spectral characteristics typically require subjective 
tuning of parameters for each class in order to correlate 
with subjective thresholds. Further, algorithms that 
10 automatically identify impulses in a sound do not 
characterize both the temporal and spectral content of the 
impulses . 

Correlation to subjective thresholds is largely due 
to processing the sound with a model of the auditory 

15 system. This provides the temporal and spectral data 
relevant to hearing. Most algorithms use wavelets or 
other time-frequency techniques, and as a result, it is 
difficult to generalize hearing properties to these 
models. Of the current algorithms that are based on 

20 auditory models, they require subjective interpretation of 
the temporal and spectral information to identify the 
impulsive sounds. 

It is therefore desired to have a method and 
implementation for detecting and characterizing audible 

25 transients in noise, specifically having automated 
interpretation of temporal and spectral information, and 
the ability to identify impulsive sounds over a large 
range of background sound levels . 

30 Summary of the Invention 

It is an object of the present invention to provide a 
method and implementation for detecting and characterizing 



audible transients in noise that overcomes the 
disadvantages of the prior art. 

Accordingly, the present invention advantageously 
provides a method and implementation for detecting and 
5 characterizing audible transients in noise including 
placing a microphone in a desired location, producing a 
microphone signal wherein the microphone signal is 
indicative of the acoustic environment, processing the 
microphone signal to estimate the acoustic activity that 

10 takes place in the human auditory system in response to 
the acoustic environment, producing an excitation signal 
indicative of the estimated acoustic activity, processing 
the excitation signal to identify each impulsive sound 
frequency-dependent activity as a function of time, 

15 producing a detection signal indicative of audible impulse 
sounds, processing the detection signal to identify an 
audible impulsive sound, and characterizing each impulsive 
sound. 

It is a feature of the present invention that the 
20 method and implementation for detecting and characterizing 
audible transients in noise has automated interpretation 
of temporal and spectral information, and has the ability 
to identify impulsive sounds over a large range of 
background sound levels. 

25 

Brief Description of the Drawings 

These and other objects, features, and advantages of 
the present invention will become apparent from a reading 
30 of the following detailed description with reference to 
the accompanying drawings, in which: 



FIG. 1 is a flow diagram showing the processing and 
detecting of impulsive sounds of the present invention; 
and 

FIG. 2 is a detailed flow diagram showing the 
5 psychoacoustic, detection, and characterization processes 
of the present invention. 

Description of the Preferred Embodiment ( s ) 

10 Referring to FIG. 1, a flow diagram 10 showing the 

processing and detecting of impulsive sounds of the 
present invention is shown. Flow diagram 10 includes two 
stages: an auditory model processing stage 12, and a 
detection and classification processing stage 14. 

15 Initially, auditory model processing stage 12 

receives a microphone signal 16 that is processed using a 
model of the human auditory system. Stage 12 then outputs 
twenty channels of data 18, where each channel represents 
frequency-dependent activity in the auditory system as a 

20 function of time. This output data 18 is processed to 
detect and characterize impulsive sounds. Examples of 
data from three channels 20 are shown, where traces have 
been offset vertically for viewing purposes. 

Detection and classification processing stage 14 

25 receives the data 18 from the auditory model processing 
stage 12. If an impulsive sound is detected, it is 
characterized by its time-of-occurrence and intensity. An 
example of detecting and characterizing two impulsive 
sounds 22 is shown. 

30 Referring now to FIG. 2, a highly detailed flow 

diagram 24 showing the auditory model processing or 
psychoacoustic model stage 12, detection and 
classification processing stage 14, and characterization 



process stage 26 of the present invention is shown. 
Psychoacoustic model stage 12 consists of the following 
phases: critical band filtering 28, extraction envelope of 
waveform 30, conversion to dB SPL 32, conversion to 
5 excitation levels in auditory system 34, and the 
psychoacoustic process of temporal masking 36. The 
detection and classification processing stage 14 consists 
of the following phases: compression 38, impulse 

detection 40, calculation of impulse magnitude 42, 

10 normalization of impulse magnitude 44, threshold impulse 
magnitude 46, combining impulses across critical bands 48, 
and detection rules for impulsive events 50. 

The psychoacoustic model stage 12 attempts to 
represent excitation levels, or acoustic activity, in the 

15 human auditory system. The first phase of processing 
sound in the auditory system is implemented by passing the 
sound through a bank of bandpass filters, known as 
critical band filtering 28. The remaining phases model 
non-linear processing in the auditory system, resulting in 

20 a time-frequency representation of the acoustic activity 
in the auditory system. 

In operation, critical band filtering 28 divides the 
microphone signal 16 into twenty equal signals. The 
microphone signal 16 is an electrical signal representing 

25 the acoustic environment, possibly containing transient or 
impulsive sounds. Critical band filtering 28 filters the 
divided signal to extract signals with different frequency 
content. Each critical band filter corresponds to a 
respective divided signal. Each filter is preferably 

30 derived from 1/3 octave filters. Each filter receives its 
respective divided signal to pass a signal of desired 
frequency content. 



Phase 30 then extracts the envelope of the waveform 
of the divided filtered signal. Then phase 32 converts 
the extracted envelope to decibel, or dB SPL. Phase 34 
then converts the extracted envelope to an excitation 

5 level corresponding to an excitation level used in the 
auditory system, also called specific loudness. Phase 36 
then temporal masks the extracted envelope, also called 
postmasking. Postmasking refers to the masking of a sound 
by a previously-occurring sound. Postmasking effects are 

10 caused by the decay of specific loudness levels in the 
masker. 

Phase 38 of the detection and classification 
processing stage 14 then compresses the output of temporal 
masking phase 36 of the psychoacoustic model stage 12. 

15 Compression 38 is done through log2{)- The output of 
temporal masking phase 36 is in units of sone/bark, which 
generally follows a doubling law. That is, if sound A 
generates x sone/bark in a particular critical band, then 
doubling the loudness of A will generate approximately 2x 

20 sone/bark in that critical band. Compression 38 through 
log2() allows for computing relative changes in the 
excitation level, independent of the absolute value. 

Phase 40 then detects the impulse of the compressed 
output signal from the compression phase 38. In the 

25 impulse detection phase 40, standard peak-picking 
algorithms are used. The peaks are selected such that 
they are the largest peaks within a neighborhood ranging 
from approximately 10-50 msec depending on the critical 
band center frequency. 

30 Phase 42 then calculates the magnitude of the impulse 

detected by phase 40. Both compressed and uncompressed 
magnitudes of each impulse are calculated by taking the 



difference between its peak value and a local minimum 
preceding the peak. 

Phase 44 then normalizes the impulse magnitude 
calculated by phase 42. The compressed impulsive 

5 magnitudes are normalized by their root-mean square (RMS) 
value within the critical band. 

Phase 4 6 then thresholds the normalized magnitudes 
from phase 44. The only impulses that are kept are the 
impulses that have normalized magnitudes greater than a. 

10 Empirically, a=2 results in satisfactory agreement of the 
algorithm with detection of transient sounds by listeners. 

Phase 48 then combines the impulses across the 
critical bands from the twenty divided signals. To 
combine the divided signals, phase 48 searches for time- 

15 alignment of impulses across the critical bands. In 
particular, at time t, phase 48 identifies the normalized 
impulses across all critical bands that are within a 
temporal window of 5 msec duration and centered at t. 
Phase 48 then computes the sum-of-sguares of the 

20 identified normalized impulses for time sample t. The 
square root of the result is set equal to Kn (t) . 
Similarly, for the corresponding uncompressed impulse 
magnitudes, phase 48 computes K^(t) . Each one of the 
events where Kn(t)>0 is labeled a potential impulsive 

25 event . 

Phase 50 then processes the potential impulsive 
events in accordance with the detection rule for 
identifying an audible impulsive event. In particular, if 
Kn(t)>3.Q and K^(t)>Q.2 then the potential impulsive event 
30 is labeled as an impulsive event. 

In the characterization process stage 26, each 
impulsive event from phase 50 of the detection and 



classification processing stage 14 is characterized by its 
time-of -occurrence, t, and by its intensity, Kn(t). 

While only one embodiment of the method and 
implementation for detecting and characterizing audible 
transients in noise of the present invention has been 
described, others may be possible without departing from 
the scope of the following claims . 



